NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

VL-TGS: Trajectory Generation and Selection Using Vision Language Models in Mapless Outdoor Environments

https://doi.org/10.1109/LRA.2025.3559822

Song, Daeun; Liang, Jing; Xiao, Xuesu; Manocha, Dinesh (April 2025, IEEE Robotics and Automation Letters)

We present a multi-modal trajectory generation and selection algorithm for real-world mapless outdoor navigation in human-centered environments. Such environments contain rich features like crosswalks, grass, and curbs, which are easily interpretable by humans, but not by mobile robots. We aim to compute suitable trajectories that (1) satisfy the environment-specific traversability constraints and (2) generate human-like paths while navigating on crosswalks, sidewalks, etc. Our formulation uses a Conditional Variational Autoencoder (CVAE) generative model enhanced with traversability constraints to generate multiple candidate trajectories for global navigation. We develop a visual prompting approach and leverage the Visual Language Model's (VLM) zero-shot ability of semantic understanding and logical reasoning to choose the best trajectory given the contextual information about the task. We evaluate our method in various outdoor scenes with wheeled robots and compare the performance with other global navigation algorithms. In practice, we observe an average improvement of 20.81% in satisfying traversability constraints and 28.51% in terms of human-like navigation in four different outdoor navigation scenarios.
more » « less
Free, publicly-accessible full text available April 10, 2026
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement

Bae; Jaesung; Kuznetsova, Anastasia; Manocha, Dinesh; Hershey, John; Kristjansson, Trausti; Kim, Minje (April 2025, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW): Generative Data Augmentation for Real-World Signal Processing Applications (GenDA 2025))

Free, publicly-accessible full text available April 7, 2026
VLM-Social-Nav: Socially Aware Robot Navigation Through Scoring Using Vision-Language Models

https://doi.org/10.1109/LRA.2024.3511409

Song, Daeun; Liang, Jing; Payandeh, Amirreza; Raj, Amir Hossain; Xiao, Xuesu; Manocha, Dinesh (January 2025, IEEE Robotics and Automation Letters)

We propose VLM-Social-Nav, a novel Vision-Language Model (VLM) based navigation approach to compute a robot's motion in human-centered environments. Our goal is to make real-time decisions on robot actions that are socially compliant with human expectations. We utilize a perception model to detect important social entities and prompt a VLM to generate guidance for socially compliant robot behavior. VLM-Social-Nav uses a VLM-based scoring module that computes a cost term that ensures socially appropriate and effective robot actions generated by the underlying planner. Our overall approach reduces reliance on large training datasets and enhances adaptability in decision-making. In practice, it results in improved socially compliant navigation in human-shared environments. We demonstrate and evaluate our system in four different real-world social navigation scenarios with a Turtlebot robot. We observe at least 27.38% improvement in the average success rate and 19.05% improvement in the average collision rate in the four social navigation scenarios. Our user study score shows that VLM-Social-Nav generates the most socially compliant navigation behavior.
more » « less
Full Text Available
Generative Data Augmentation Challenge: Synthesis of Room Acoustics for Speaker Distance Estimation

Lin, Jackie; Götz, Georg; Llopis, Hermes Sampedro; Hafsteinsson, Haukur; Guðjónsson, Steinar; Nielsen, Daniel Gert; Pind, Finnur; Smaragdis, Paris; Manocha, Dinesh; Hershey, John; et al (April 2025, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW): Generative Data Augmentation for Real-World Signal Processing Applications (GenDA 2025))

Free, publicly-accessible full text available April 7, 2026
DTG : Diffusion-based Trajectory Generation for Mapless Global Navigation

https://doi.org/10.1109/IROS58592.2024.10802055

Liang, Jing; Payandeh, Amirreza; Song, Daeun; Xiao, Xuesu; Manocha, Dinesh (October 2024, IEEE)

We present a novel end-to-end diffusion-based trajectory generation method, DTG, for mapless global navigation in challenging outdoor scenarios with occlusions and unstructured off-road features like grass, buildings, bushes, etc. Given a distant goal, our approach computes a trajectory that satisfies the following goals: (1) minimize the travel distance to the goal; (2) maximize the traversability by choosing paths that do not lie in undesirable areas. Specifically, we present a novel Conditional RNN(CRNN) for diffusion models to efficiently generate trajectories. Furthermore, we propose an adaptive training method that ensures that the diffusion model generates more traversable trajectories. We evaluate our methods in various outdoor scenes and compare the performance with other global navigation algorithms on a Husky robot. In practice, we observe at least a 15% improvement in traveling distance and around a 7% improvement in traversability. Video and Code: https://github.com/jingGM/DTG.git.
more » « less
Full Text Available
AG-Cvg: Coverage Planning with a Mobile Recharging UGV and an Energy-Constrained UAV

https://doi.org/10.1109/ICRA57147.2024.10610339

Karapetyan, Nare; Asghar, Ahmad Bilal; Bhaskar, Amisha; Shi, Guangyao; Manocha, Dinesh; Tokekar, Pratap (May 2024, IEEE)

Full Text Available
MTG: Mapless Trajectory Generator with Traversability Coverage for Outdoor Navigation

https://doi.org/10.1109/ICRA57147.2024.10611319

Liang, Jing; Gao, Peng; Xiao, Xuesu; Sathyamoorthy, Adarsh Jagan; Elnoor, Mohamed; Lin, Ming C; Manocha, Dinesh (May 2024, IEEE)

We present a novel learning-based trajectory generation algorithm for outdoor robot navigation. Our goal is to compute collision-free paths that also satisfy the environment-specific traversability constraints. Our approach is designed for global planning using limited onboard robot perception in mapless environments while ensuring comprehensive coverage of all traversable directions. Our formulation uses a Conditional Variational Autoencoder (CVAE) generative model that is enhanced with traversability constraints and an optimization formulation used for the coverage. We highlight the benefits of our approach over state-of-the-art trajectory generation approaches and demonstrate its performance in challenging and large outdoor environments, including around buildings, across intersections, along trails, and off-road terrain, using a Clearpath Husky and a Boston Dynamics Spot robot. In practice, our approach results in a 6% improvement in coverage of traversable areas and an 89% reduction in trajectory portions residing in non-traversable regions. Our video is here: https://youtu.be/3eJ2soAzXnU
more » « less
Full Text Available
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning

Chakraborty, Souradip; Bedi, Amrit; Koppel, Alec; Wang, Huazheng; Manocha, Dinesh; Wang, Mengdi; Huang, Furong (January 2024, The Twelfth International Conference on Learning Representations)
GAMEOPT: Optimal Real-time Multi-Agent Planning and Control for Dynamic Intersections

https://doi.org/10.1109/ITSC55140.2022.9921968

Suriyarachchi, Nilesh; Chandra, Rohan; Baras, John S; Manocha, Dinesh (October 2022, Proceedings)

Full Text Available
Grasping Fragile Objects Using A Stress-Minimization Metric

Pan, Zherong; Gao, Xifeng; Manocha, Dinesh (January 2020, International Conference on Robotics and Automation (ICRA 2020))

We present a new method to generate optimal grasps for brittle and fragile objects using a novel stressminimization (SM) metric. Our approach is designed for objects that are composed of homogeneous isotopic materials. Our SM metric measures the maximal resistible external wrenches that would not result in fractures in the target objects. In this paper, we propose methods to compute our new metric. We also use our SM metric to design optimal grasp planning algorithms. Finally, we compare the performance of our metric and conventional grasp metrics, including Q1, Q∞, QG11, QMSV , QV EW . Our experiments show that our SM metric takes into account the material characteristics and object shapes to indicate the fragile regions, where prior methods may not work well. We also show that the computational cost of our SM metric is on par with prior methods. Finally, we show that grasp planners guided by our metric can lower the probability of breaking target objects.
more » « less
Full Text Available

« Prev Next »

Search for: All records